Dataset statistics
| Number of variables | 24 |
|---|---|
| Number of observations | 3664 |
| Missing cells | 2 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 357 |
| Duplicate rows (%) | 9.7% |
| Total size in memory | 687.1 KiB |
| Average record size in memory | 192.0 B |
Variable types
| Numeric | 16 |
|---|---|
| Categorical | 8 |
url_chinese_present has constant value "" | Constant |
html_num_tags('applet') has constant value "" | Constant |
| Dataset has 357 (9.7%) duplicate rows | Duplicates |
url_len is highly overall correlated with url_path_len and 1 other fields | High correlation |
url_num_hyphens_dom is highly overall correlated with url_domain_len and 1 other fields | High correlation |
url_path_len is highly overall correlated with url_len and 1 other fields | High correlation |
url_domain_len is highly overall correlated with url_num_hyphens_dom and 1 other fields | High correlation |
url_hostname_len is highly overall correlated with url_num_hyphens_dom and 1 other fields | High correlation |
url_query_len is highly overall correlated with url_num_query_para | High correlation |
url_num_query_para is highly overall correlated with url_query_len | High correlation |
url_entropy is highly overall correlated with url_len and 1 other fields | High correlation |
html_num_tags('script') is highly overall correlated with html_num_tags('div') and 1 other fields | High correlation |
html_num_tags('object') is highly overall correlated with html_num_tags('embed') | High correlation |
html_num_tags('div') is highly overall correlated with html_num_tags('script') and 2 other fields | High correlation |
html_num_tags('form') is highly overall correlated with html_num_tags('div') | High correlation |
html_num_tags('a') is highly overall correlated with html_num_tags('script') and 1 other fields | High correlation |
html_num_tags('embed') is highly overall correlated with html_num_tags('object') | High correlation |
url_ip_present is highly imbalanced (66.9%) | Imbalance |
url_port is highly imbalanced (97.8%) | Imbalance |
html_num_tags('embed') is highly imbalanced (92.0%) | Imbalance |
html_num_tags('head') is highly imbalanced (91.8%) | Imbalance |
html_num_tags('body') is highly imbalanced (83.9%) | Imbalance |
html_num_tags('div') is highly skewed (γ1 = 45.20523454) | Skewed |
html_num_tags('a') is highly skewed (γ1 = 32.57211778) | Skewed |
url_num_hyphens_dom has 2734 (74.6%) zeros | Zeros |
url_path_len has 626 (17.1%) zeros | Zeros |
url_num_underscores has 3206 (87.5%) zeros | Zeros |
url_query_len has 3436 (93.8%) zeros | Zeros |
url_num_query_para has 3471 (94.7%) zeros | Zeros |
html_num_tags('iframe') has 3172 (86.6%) zeros | Zeros |
html_num_tags('script') has 486 (13.3%) zeros | Zeros |
html_num_tags('object') has 3579 (97.7%) zeros | Zeros |
html_num_tags('div') has 328 (9.0%) zeros | Zeros |
html_num_tags('form') has 1191 (32.5%) zeros | Zeros |
html_num_tags('a') has 672 (18.3%) zeros | Zeros |
Reproduction
| Analysis started | 2023-03-07 08:28:07.486248 |
|---|---|
| Analysis finished | 2023-03-07 08:28:32.559433 |
| Duration | 25.07 seconds |
| Software version | ydata-profiling vv4.0.0 |
| Download configuration | config.json |
url_len
Real number (ℝ)
| Distinct | 242 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.559225 |
| Minimum | 6 |
|---|---|
| Maximum | 1837 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 24 |
| median | 36 |
| Q3 | 55 |
| 95-th percentile | 157 |
| Maximum | 1837 |
| Range | 1831 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 82.493265 |
|---|---|
| Coefficient of variation (CV) | 1.5119948 |
| Kurtosis | 176.17286 |
| Mean | 54.559225 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | 10.797476 |
| Sum | 199905 |
| Variance | 6805.1387 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17 | 147 | 4.0% |
| 34 | 117 | 3.2% |
| 39 | 101 | 2.8% |
| 18 | 97 | 2.6% |
| 37 | 91 | 2.5% |
| 26 | 90 | 2.5% |
| 23 | 84 | 2.3% |
| 32 | 84 | 2.3% |
| 24 | 83 | 2.3% |
| 33 | 83 | 2.3% |
| Other values (232) | 2687 |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 7 | 2 | 0.1% |
| 9 | 5 | 0.1% |
| 10 | 5 | 0.1% |
| 11 | 13 | 0.4% |
| 12 | 20 | 0.5% |
| 13 | 49 | |
| 14 | 65 | |
| 15 | 58 | |
| 16 | 70 |
| Value | Count | Frequency (%) |
| 1837 | 1 | |
| 1709 | 1 | |
| 1583 | 1 | |
| 1302 | 1 | |
| 1143 | 1 | |
| 1043 | 1 | |
| 952 | 1 | |
| 926 | 1 | |
| 880 | 1 | |
| 629 | 1 |
url_num_hyphens_dom
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.41293668 |
| Minimum | 0 |
|---|---|
| Maximum | 14 |
| Zeros | 2734 |
| Zeros (%) | 74.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 14 |
| Range | 14 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.84705763 |
|---|---|
| Coefficient of variation (CV) | 2.0513015 |
| Kurtosis | 22.93946 |
| Mean | 0.41293668 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.1968506 |
| Sum | 1513 |
| Variance | 0.71750663 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2734 | |
| 1 | 509 | 13.9% |
| 2 | 315 | 8.6% |
| 3 | 71 | 1.9% |
| 4 | 26 | 0.7% |
| 5 | 5 | 0.1% |
| 6 | 3 | 0.1% |
| 14 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2734 | |
| 1 | 509 | 13.9% |
| 2 | 315 | 8.6% |
| 3 | 71 | 1.9% |
| 4 | 26 | 0.7% |
| 5 | 5 | 0.1% |
| 6 | 3 | 0.1% |
| 14 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 14 | 1 | < 0.1% |
| 6 | 3 | 0.1% |
| 5 | 5 | 0.1% |
| 4 | 26 | 0.7% |
| 3 | 71 | 1.9% |
| 2 | 315 | 8.6% |
| 1 | 509 | 13.9% |
| 0 | 2734 |
url_path_len
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 203 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.36582 |
| Minimum | 0 |
|---|---|
| Maximum | 1816 |
| Zeros | 626 |
| Zeros (%) | 17.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 13 |
| Q3 | 31 |
| 95-th percentile | 96 |
| Maximum | 1816 |
| Range | 1816 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 78.595248 |
|---|---|
| Coefficient of variation (CV) | 2.6764193 |
| Kurtosis | 218.09471 |
| Mean | 29.36582 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 12.49738 |
| Sum | 107567 |
| Variance | 6177.2129 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 626 | 17.1% |
| 1 | 470 | 12.8% |
| 10 | 306 | 8.4% |
| 13 | 129 | 3.5% |
| 17 | 129 | 3.5% |
| 20 | 75 | 2.0% |
| 22 | 72 | 2.0% |
| 9 | 70 | 1.9% |
| 11 | 67 | 1.8% |
| 6 | 63 | 1.7% |
| Other values (193) | 1656 |
| Value | Count | Frequency (%) |
| 0 | 626 | |
| 1 | 470 | |
| 2 | 3 | 0.1% |
| 3 | 13 | 0.4% |
| 4 | 21 | 0.6% |
| 5 | 24 | 0.7% |
| 6 | 63 | 1.7% |
| 7 | 60 | 1.6% |
| 8 | 41 | 1.1% |
| 9 | 70 | 1.9% |
| Value | Count | Frequency (%) |
| 1816 | 1 | |
| 1690 | 1 | |
| 1566 | 1 | |
| 1286 | 1 | |
| 1127 | 1 | |
| 1022 | 1 | |
| 936 | 1 | |
| 910 | 1 | |
| 866 | 1 | |
| 607 | 1 |
url_domain_len
Real number (ℝ)
| Distinct | 67 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.383292 |
| Minimum | 4 |
|---|---|
| Maximum | 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 14 |
| median | 17 |
| Q3 | 24 |
| 95-th percentile | 37 |
| Maximum | 109 |
| Range | 105 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 9.5970119 |
|---|---|
| Coefficient of variation (CV) | 0.47082737 |
| Kurtosis | 15.568031 |
| Mean | 20.383292 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 2.805769 |
| Sum | 74664 |
| Variance | 92.102637 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17 | 485 | 13.2% |
| 13 | 267 | 7.3% |
| 14 | 252 | 6.9% |
| 15 | 213 | 5.8% |
| 16 | 213 | 5.8% |
| 18 | 192 | 5.2% |
| 20 | 167 | 4.6% |
| 12 | 152 | 4.1% |
| 11 | 138 | 3.8% |
| 21 | 130 | 3.5% |
| Other values (57) | 1454 |
| Value | Count | Frequency (%) |
| 4 | 1 | < 0.1% |
| 5 | 5 | 0.1% |
| 6 | 6 | 0.2% |
| 7 | 24 | 0.7% |
| 8 | 10 | 0.3% |
| 9 | 44 | 1.2% |
| 10 | 50 | 1.4% |
| 11 | 138 | |
| 12 | 152 | |
| 13 | 267 |
| Value | Count | Frequency (%) |
| 109 | 1 | < 0.1% |
| 104 | 2 | |
| 103 | 1 | < 0.1% |
| 101 | 4 | |
| 100 | 1 | < 0.1% |
| 85 | 1 | < 0.1% |
| 77 | 1 | < 0.1% |
| 74 | 2 | |
| 72 | 1 | < 0.1% |
| 68 | 1 | < 0.1% |
url_hostname_len
Real number (ℝ)
| Distinct | 67 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.330513 |
| Minimum | 4 |
|---|---|
| Maximum | 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 14 |
| median | 17 |
| Q3 | 24 |
| 95-th percentile | 37 |
| Maximum | 109 |
| Range | 105 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 9.6280931 |
|---|---|
| Coefficient of variation (CV) | 0.47357846 |
| Kurtosis | 15.392418 |
| Mean | 20.330513 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 2.7904095 |
| Sum | 74491 |
| Variance | 92.700177 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17 | 461 | 12.6% |
| 13 | 284 | 7.8% |
| 14 | 270 | 7.4% |
| 15 | 219 | 6.0% |
| 16 | 198 | 5.4% |
| 18 | 185 | 5.0% |
| 20 | 165 | 4.5% |
| 12 | 156 | 4.3% |
| 11 | 144 | 3.9% |
| 21 | 130 | 3.5% |
| Other values (57) | 1452 |
| Value | Count | Frequency (%) |
| 4 | 1 | < 0.1% |
| 5 | 5 | 0.1% |
| 6 | 6 | 0.2% |
| 7 | 24 | 0.7% |
| 8 | 10 | 0.3% |
| 9 | 44 | 1.2% |
| 10 | 50 | 1.4% |
| 11 | 144 | |
| 12 | 156 | |
| 13 | 284 |
| Value | Count | Frequency (%) |
| 109 | 1 | < 0.1% |
| 104 | 2 | |
| 103 | 1 | < 0.1% |
| 101 | 4 | |
| 100 | 1 | < 0.1% |
| 85 | 1 | < 0.1% |
| 77 | 1 | < 0.1% |
| 74 | 2 | |
| 72 | 1 | < 0.1% |
| 68 | 1 | < 0.1% |
url_num_dots
Real number (ℝ)
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5169214 |
| Minimum | 1 |
|---|---|
| Maximum | 32 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 32 |
| Range | 31 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.5341193 |
|---|---|
| Coefficient of variation (CV) | 0.60952212 |
| Kurtosis | 74.78281 |
| Mean | 2.5169214 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 5.7030461 |
| Sum | 9222 |
| Variance | 2.3535219 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 1505 | |
| 3 | 952 | |
| 1 | 662 | |
| 4 | 383 | 10.5% |
| 5 | 53 | 1.4% |
| 6 | 50 | 1.4% |
| 8 | 15 | 0.4% |
| 7 | 13 | 0.4% |
| 9 | 10 | 0.3% |
| 11 | 5 | 0.1% |
| Other values (7) | 16 | 0.4% |
| Value | Count | Frequency (%) |
| 1 | 662 | |
| 2 | 1505 | |
| 3 | 952 | |
| 4 | 383 | 10.5% |
| 5 | 53 | 1.4% |
| 6 | 50 | 1.4% |
| 7 | 13 | 0.4% |
| 8 | 15 | 0.4% |
| 9 | 10 | 0.3% |
| 10 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 32 | 1 | < 0.1% |
| 26 | 2 | 0.1% |
| 16 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 5 | 0.1% |
| 12 | 1 | < 0.1% |
| 11 | 5 | 0.1% |
| 10 | 5 | 0.1% |
| 9 | 10 | |
| 8 | 15 |
url_num_underscores
Real number (ℝ)
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.27265284 |
| Minimum | 0 |
|---|---|
| Maximum | 18 |
| Zeros | 3206 |
| Zeros (%) | 87.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 18 |
| Range | 18 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.1245917 |
|---|---|
| Coefficient of variation (CV) | 4.1246286 |
| Kurtosis | 96.850391 |
| Mean | 0.27265284 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.5207928 |
| Sum | 999 |
| Variance | 1.2647065 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3206 | |
| 1 | 257 | 7.0% |
| 2 | 87 | 2.4% |
| 3 | 57 | 1.6% |
| 4 | 29 | 0.8% |
| 6 | 7 | 0.2% |
| 14 | 6 | 0.2% |
| 5 | 5 | 0.1% |
| 12 | 4 | 0.1% |
| 10 | 2 | 0.1% |
| Other values (3) | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 3206 | |
| 1 | 257 | 7.0% |
| 2 | 87 | 2.4% |
| 3 | 57 | 1.6% |
| 4 | 29 | 0.8% |
| 5 | 5 | 0.1% |
| 6 | 7 | 0.2% |
| 10 | 2 | 0.1% |
| 11 | 1 | < 0.1% |
| 12 | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 18 | 2 | 0.1% |
| 15 | 1 | < 0.1% |
| 14 | 6 | 0.2% |
| 12 | 4 | 0.1% |
| 11 | 1 | < 0.1% |
| 10 | 2 | 0.1% |
| 6 | 7 | 0.2% |
| 5 | 5 | 0.1% |
| 4 | 29 | |
| 3 | 57 |
url_query_len
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 78 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.7076965 |
| Minimum | 0 |
|---|---|
| Maximum | 429 |
| Zeros | 3436 |
| Zeros (%) | 93.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 27.85 |
| Maximum | 429 |
| Range | 429 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 25.318285 |
|---|---|
| Coefficient of variation (CV) | 5.3780622 |
| Kurtosis | 85.203526 |
| Mean | 4.7076965 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.0545636 |
| Sum | 17249 |
| Variance | 641.01555 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3436 | |
| 41 | 50 | 1.4% |
| 157 | 21 | 0.6% |
| 5 | 12 | 0.3% |
| 11 | 8 | 0.2% |
| 44 | 6 | 0.2% |
| 165 | 6 | 0.2% |
| 45 | 5 | 0.1% |
| 70 | 5 | 0.1% |
| 36 | 4 | 0.1% |
| Other values (68) | 111 | 3.0% |
| Value | Count | Frequency (%) |
| 0 | 3436 | |
| 5 | 12 | 0.3% |
| 6 | 2 | 0.1% |
| 9 | 1 | < 0.1% |
| 11 | 8 | 0.2% |
| 13 | 1 | < 0.1% |
| 15 | 4 | 0.1% |
| 16 | 3 | 0.1% |
| 17 | 2 | 0.1% |
| 18 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 429 | 2 | |
| 350 | 1 | |
| 312 | 1 | |
| 289 | 1 | |
| 271 | 1 | |
| 248 | 1 | |
| 208 | 1 | |
| 200 | 2 | |
| 185 | 1 | |
| 173 | 1 |
url_num_query_para
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.10425764 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 3471 |
| Zeros (%) | 94.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.57430977 |
|---|---|
| Coefficient of variation (CV) | 5.5085629 |
| Kurtosis | 93.849177 |
| Mean | 0.10425764 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.5653553 |
| Sum | 382 |
| Variance | 0.32983172 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3471 | |
| 1 | 105 | 2.9% |
| 2 | 41 | 1.1% |
| 3 | 33 | 0.9% |
| 6 | 5 | 0.1% |
| 7 | 4 | 0.1% |
| 9 | 2 | 0.1% |
| 8 | 2 | 0.1% |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 3471 | |
| 1 | 105 | 2.9% |
| 2 | 41 | 1.1% |
| 3 | 33 | 0.9% |
| 4 | 1 | < 0.1% |
| 6 | 5 | 0.1% |
| 7 | 4 | 0.1% |
| 8 | 2 | 0.1% |
| 9 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 9 | 2 | 0.1% |
| 8 | 2 | 0.1% |
| 7 | 4 | 0.1% |
| 6 | 5 | 0.1% |
| 4 | 1 | < 0.1% |
| 3 | 33 | 0.9% |
| 2 | 41 | 1.1% |
| 1 | 105 | 2.9% |
| 0 | 3471 |
url_ip_present
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 223 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 10992 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 3441 | |
| 1.0 | 223 | 6.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 3441 | |
| 1.0 | 223 | 6.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7105 | |
| . | 3664 | |
| 1 | 223 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7328 | |
| Other Punctuation | 3664 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7105 | |
| 1 | 223 | 3.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3664 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10992 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7105 | |
| . | 3664 | |
| 1 | 223 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10992 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7105 | |
| . | 3664 | |
| 1 | 223 | 2.0% |
url_entropy
Real number (ℝ)
| Distinct | 2524 |
|---|---|
| Distinct (%) | 68.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.2285684 |
| Minimum | 2.7378394 |
|---|---|
| Maximum | 5.8217821 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 2.7378394 |
|---|---|
| 5-th percentile | 3.6197263 |
| Q1 | 3.9831956 |
| median | 4.1895611 |
| Q3 | 4.4589405 |
| 95-th percentile | 4.9033082 |
| Maximum | 5.8217821 |
| Range | 3.0839426 |
| Interquartile range (IQR) | 0.47574484 |
Descriptive statistics
| Standard deviation | 0.3930554 |
|---|---|
| Coefficient of variation (CV) | 0.092952357 |
| Kurtosis | 0.69692091 |
| Mean | 4.2285684 |
| Median Absolute Deviation (MAD) | 0.22579503 |
| Skewness | 0.30399941 |
| Sum | 15493.475 |
| Variance | 0.15449254 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.970175521 | 18 | 0.5% |
| 4.084962501 | 14 | 0.4% |
| 3.886842188 | 13 | 0.4% |
| 3.97366069 | 12 | 0.3% |
| 3.788754914 | 11 | 0.3% |
| 3.938721876 | 11 | 0.3% |
| 4.053508855 | 11 | 0.3% |
| 3.558518613 | 10 | 0.3% |
| 3.689703732 | 10 | 0.3% |
| 4.168295834 | 9 | 0.2% |
| Other values (2514) | 3545 |
| Value | Count | Frequency (%) |
| 2.737839416 | 1 | |
| 2.819808339 | 1 | |
| 2.971860874 | 1 | |
| 3.012015896 | 1 | |
| 3.019765516 | 1 | |
| 3.074515896 | 2 | |
| 3.077323802 | 1 | |
| 3.103701696 | 1 | |
| 3.127986807 | 1 | |
| 3.137015896 | 1 |
| Value | Count | Frequency (%) |
| 5.821782065 | 1 | |
| 5.815521588 | 1 | |
| 5.676410099 | 1 | |
| 5.65563894 | 1 | |
| 5.64727204 | 1 | |
| 5.645037892 | 1 | |
| 5.624739025 | 2 | |
| 5.612819605 | 1 | |
| 5.567501007 | 1 | |
| 5.566057159 | 1 |
url_chinese_present
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| 0.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 10992 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 3664 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 3664 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7328 | |
| . | 3664 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7328 | |
| Other Punctuation | 3664 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7328 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3664 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10992 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7328 | |
| . | 3664 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10992 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7328 | |
| . | 3664 |
url_port
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 8 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 10992 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 3656 | |
| 1.0 | 8 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 3656 | |
| 1.0 | 8 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7320 | |
| . | 3664 | |
| 1 | 8 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7328 | |
| Other Punctuation | 3664 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7320 | |
| 1 | 8 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3664 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10992 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7320 | |
| . | 3664 | |
| 1 | 8 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10992 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7320 | |
| . | 3664 | |
| 1 | 8 | 0.1% |
html_num_tags('iframe')
Real number (ℝ)
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.22079694 |
| Minimum | 0 |
|---|---|
| Maximum | 26 |
| Zeros | 3172 |
| Zeros (%) | 86.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 26 |
| Range | 26 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.89838258 |
|---|---|
| Coefficient of variation (CV) | 4.068818 |
| Kurtosis | 254.94884 |
| Mean | 0.22079694 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.35457 |
| Sum | 809 |
| Variance | 0.80709126 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3172 | |
| 1 | 365 | 10.0% |
| 2 | 59 | 1.6% |
| 3 | 41 | 1.1% |
| 4 | 8 | 0.2% |
| 5 | 6 | 0.2% |
| 8 | 3 | 0.1% |
| 12 | 2 | 0.1% |
| 6 | 2 | 0.1% |
| 10 | 2 | 0.1% |
| Other values (4) | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 3172 | |
| 1 | 365 | 10.0% |
| 2 | 59 | 1.6% |
| 3 | 41 | 1.1% |
| 4 | 8 | 0.2% |
| 5 | 6 | 0.2% |
| 6 | 2 | 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 3 | 0.1% |
| 10 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 26 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 12 | 2 | 0.1% |
| 11 | 1 | < 0.1% |
| 10 | 2 | 0.1% |
| 8 | 3 | 0.1% |
| 7 | 1 | < 0.1% |
| 6 | 2 | 0.1% |
| 5 | 6 | |
| 4 | 8 |
html_num_tags('script')
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 78 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.7854803 |
| Minimum | 0 |
|---|---|
| Maximum | 267 |
| Zeros | 486 |
| Zeros (%) | 13.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 4 |
| Q3 | 12 |
| 95-th percentile | 30 |
| Maximum | 267 |
| Range | 267 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 12.647356 |
|---|---|
| Coefficient of variation (CV) | 1.4395748 |
| Kurtosis | 69.093899 |
| Mean | 8.7854803 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 5.5378516 |
| Sum | 32190 |
| Variance | 159.95561 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 646 | |
| 0 | 486 | |
| 1 | 355 | 9.7% |
| 3 | 232 | 6.3% |
| 9 | 184 | 5.0% |
| 8 | 172 | 4.7% |
| 4 | 153 | 4.2% |
| 17 | 133 | 3.6% |
| 5 | 121 | 3.3% |
| 6 | 109 | 3.0% |
| Other values (68) | 1073 |
| Value | Count | Frequency (%) |
| 0 | 486 | |
| 1 | 355 | |
| 2 | 646 | |
| 3 | 232 | 6.3% |
| 4 | 153 | 4.2% |
| 5 | 121 | 3.3% |
| 6 | 109 | 3.0% |
| 7 | 100 | 2.7% |
| 8 | 172 | 4.7% |
| 9 | 184 | 5.0% |
| Value | Count | Frequency (%) |
| 267 | 1 | |
| 174 | 1 | |
| 140 | 1 | |
| 129 | 2 | |
| 108 | 1 | |
| 104 | 1 | |
| 98 | 1 | |
| 97 | 1 | |
| 90 | 1 | |
| 87 | 2 |
html_num_tags('embed')
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 60 |
| 3.0 | 2 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 10992 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 3602 | |
| 1.0 | 60 | 1.6% |
| 3.0 | 2 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 3602 | |
| 1.0 | 60 | 1.6% |
| 3.0 | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7266 | |
| . | 3664 | |
| 1 | 60 | 0.5% |
| 3 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7328 | |
| Other Punctuation | 3664 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7266 | |
| 1 | 60 | 0.8% |
| 3 | 2 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3664 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10992 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7266 | |
| . | 3664 | |
| 1 | 60 | 0.5% |
| 3 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10992 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7266 | |
| . | 3664 | |
| 1 | 60 | 0.5% |
| 3 | 2 | < 0.1% |
html_num_tags('object')
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.028930131 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 3579 |
| Zeros (%) | 97.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2376823 |
|---|---|
| Coefficient of variation (CV) | 8.2157354 |
| Kurtosis | 435.46225 |
| Mean | 0.028930131 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.802124 |
| Sum | 106 |
| Variance | 0.056492876 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3579 | |
| 1 | 76 | 2.1% |
| 2 | 5 | 0.1% |
| 3 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 3579 | |
| 1 | 76 | 2.1% |
| 2 | 5 | 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 2 | 5 | 0.1% |
| 1 | 76 | 2.1% |
| 0 | 3579 |
html_num_tags('div')
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 302 |
|---|---|
| Distinct (%) | 8.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 70.843886 |
| Minimum | 0 |
|---|---|
| Maximum | 19941 |
| Zeros | 328 |
| Zeros (%) | 9.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 6 |
| median | 33 |
| Q3 | 62 |
| 95-th percentile | 251 |
| Maximum | 19941 |
| Range | 19941 |
| Interquartile range (IQR) | 56 |
Descriptive statistics
| Standard deviation | 365.5933 |
|---|---|
| Coefficient of variation (CV) | 5.1605484 |
| Kurtosis | 2399.0481 |
| Mean | 70.843886 |
| Median Absolute Deviation (MAD) | 28 |
| Skewness | 45.205235 |
| Sum | 259572 |
| Variance | 133658.46 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 328 | 9.0% |
| 41 | 322 | 8.8% |
| 4 | 148 | 4.0% |
| 1 | 144 | 3.9% |
| 2 | 136 | 3.7% |
| 8 | 93 | 2.5% |
| 36 | 80 | 2.2% |
| 3 | 74 | 2.0% |
| 5 | 72 | 2.0% |
| 32 | 71 | 1.9% |
| Other values (292) | 2196 |
| Value | Count | Frequency (%) |
| 0 | 328 | |
| 1 | 144 | |
| 2 | 136 | |
| 3 | 74 | 2.0% |
| 4 | 148 | |
| 5 | 72 | 2.0% |
| 6 | 37 | 1.0% |
| 7 | 31 | 0.8% |
| 8 | 93 | 2.5% |
| 9 | 63 | 1.7% |
| Value | Count | Frequency (%) |
| 19941 | 1 | < 0.1% |
| 5511 | 1 | < 0.1% |
| 2992 | 1 | < 0.1% |
| 2087 | 4 | |
| 1999 | 1 | < 0.1% |
| 1604 | 1 | < 0.1% |
| 1234 | 1 | < 0.1% |
| 1122 | 1 | < 0.1% |
| 956 | 2 | |
| 950 | 1 | < 0.1% |
html_num_tags('head')
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| 1.0 | |
|---|---|
| 0.0 | 41 |
| 2.0 | 32 |
| 3.0 | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 10992 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 3590 | |
| 0.0 | 41 | 1.1% |
| 2.0 | 32 | 0.9% |
| 3.0 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 3590 | |
| 0.0 | 41 | 1.1% |
| 2.0 | 32 | 0.9% |
| 3.0 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3705 | |
| . | 3664 | |
| 1 | 3590 | |
| 2 | 32 | 0.3% |
| 3 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7328 | |
| Other Punctuation | 3664 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3705 | |
| 1 | 3590 | |
| 2 | 32 | 0.4% |
| 3 | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3664 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10992 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3705 | |
| . | 3664 | |
| 1 | 3590 | |
| 2 | 32 | 0.3% |
| 3 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10992 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3705 | |
| . | 3664 | |
| 1 | 3590 | |
| 2 | 32 | 0.3% |
| 3 | 1 | < 0.1% |
html_num_tags('body')
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| 1.0 | |
|---|---|
| 2.0 | 106 |
| 0.0 | 57 |
| 3.0 | 6 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 10992 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 3495 | |
| 2.0 | 106 | 2.9% |
| 0.0 | 57 | 1.6% |
| 3.0 | 6 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 3495 | |
| 2.0 | 106 | 2.9% |
| 0.0 | 57 | 1.6% |
| 3.0 | 6 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3721 | |
| . | 3664 | |
| 1 | 3495 | |
| 2 | 106 | 1.0% |
| 3 | 6 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7328 | |
| Other Punctuation | 3664 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3721 | |
| 1 | 3495 | |
| 2 | 106 | 1.4% |
| 3 | 6 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3664 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10992 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3721 | |
| . | 3664 | |
| 1 | 3495 | |
| 2 | 106 | 1.0% |
| 3 | 6 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10992 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3721 | |
| . | 3664 | |
| 1 | 3495 | |
| 2 | 106 | 1.0% |
| 3 | 6 | 0.1% |
html_num_tags('form')
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0289301 |
| Minimum | 0 |
|---|---|
| Maximum | 57 |
| Zeros | 1191 |
| Zeros (%) | 32.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 57 |
| Range | 57 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.5639026 |
|---|---|
| Coefficient of variation (CV) | 1.5199308 |
| Kurtosis | 472.44134 |
| Mean | 1.0289301 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 15.345319 |
| Sum | 3770 |
| Variance | 2.4457913 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1790 | |
| 0 | 1191 | |
| 2 | 433 | 11.8% |
| 3 | 132 | 3.6% |
| 4 | 49 | 1.3% |
| 5 | 40 | 1.1% |
| 7 | 12 | 0.3% |
| 19 | 5 | 0.1% |
| 8 | 4 | 0.1% |
| 6 | 4 | 0.1% |
| Other values (4) | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1191 | |
| 1 | 1790 | |
| 2 | 433 | 11.8% |
| 3 | 132 | 3.6% |
| 4 | 49 | 1.3% |
| 5 | 40 | 1.1% |
| 6 | 4 | 0.1% |
| 7 | 12 | 0.3% |
| 8 | 4 | 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 57 | 1 | < 0.1% |
| 19 | 5 | 0.1% |
| 11 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 4 | 0.1% |
| 7 | 12 | 0.3% |
| 6 | 4 | 0.1% |
| 5 | 40 | |
| 4 | 49 |
html_num_tags('a')
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 300 |
|---|---|
| Distinct (%) | 8.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66.18286 |
| Minimum | 0 |
|---|---|
| Maximum | 13451 |
| Zeros | 672 |
| Zeros (%) | 18.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 28.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 16 |
| Q3 | 52 |
| 95-th percentile | 238 |
| Maximum | 13451 |
| Range | 13451 |
| Interquartile range (IQR) | 50 |
Descriptive statistics
| Standard deviation | 342.65146 |
|---|---|
| Coefficient of variation (CV) | 5.1773444 |
| Kurtosis | 1244.8768 |
| Mean | 66.18286 |
| Median Absolute Deviation (MAD) | 16 |
| Skewness | 32.572118 |
| Sum | 242494 |
| Variance | 117410.02 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 672 | 18.3% |
| 3 | 214 | 5.8% |
| 16 | 208 | 5.7% |
| 1 | 186 | 5.1% |
| 15 | 118 | 3.2% |
| 4 | 117 | 3.2% |
| 18 | 110 | 3.0% |
| 2 | 107 | 2.9% |
| 29 | 85 | 2.3% |
| 5 | 80 | 2.2% |
| Other values (290) | 1767 |
| Value | Count | Frequency (%) |
| 0 | 672 | |
| 1 | 186 | 5.1% |
| 2 | 107 | 2.9% |
| 3 | 214 | 5.8% |
| 4 | 117 | 3.2% |
| 5 | 80 | 2.2% |
| 6 | 35 | 1.0% |
| 7 | 60 | 1.6% |
| 8 | 45 | 1.2% |
| 9 | 37 | 1.0% |
| Value | Count | Frequency (%) |
| 13451 | 1 | < 0.1% |
| 13298 | 1 | < 0.1% |
| 2664 | 1 | < 0.1% |
| 2557 | 1 | < 0.1% |
| 2501 | 1 | < 0.1% |
| 2053 | 1 | < 0.1% |
| 1526 | 1 | < 0.1% |
| 1315 | 4 | |
| 1236 | 1 | < 0.1% |
| 909 | 1 | < 0.1% |
html_num_tags('applet')
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| 0.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 10992 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 3664 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 3664 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7328 | |
| . | 3664 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7328 | |
| Other Punctuation | 3664 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7328 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3664 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10992 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7328 | |
| . | 3664 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10992 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7328 | |
| . | 3664 |
label
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| benign | |
|---|---|
| malicious |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 7.4787118 |
| Min length | 6 |
Characters and Unicode
| Total characters | 27402 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | malicious |
|---|---|
| 2nd row | benign |
| 3rd row | benign |
| 4th row | benign |
| 5th row | benign |
Common Values
| Value | Count | Frequency (%) |
| benign | 1858 | |
| malicious | 1806 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| benign | 1858 | |
| malicious | 1806 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 5470 | |
| n | 3716 | |
| b | 1858 | 6.8% |
| e | 1858 | 6.8% |
| g | 1858 | 6.8% |
| m | 1806 | 6.6% |
| a | 1806 | 6.6% |
| l | 1806 | 6.6% |
| c | 1806 | 6.6% |
| o | 1806 | 6.6% |
| Other values (2) | 3612 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 27402 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 5470 | |
| n | 3716 | |
| b | 1858 | 6.8% |
| e | 1858 | 6.8% |
| g | 1858 | 6.8% |
| m | 1806 | 6.6% |
| a | 1806 | 6.6% |
| l | 1806 | 6.6% |
| c | 1806 | 6.6% |
| o | 1806 | 6.6% |
| Other values (2) | 3612 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27402 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 5470 | |
| n | 3716 | |
| b | 1858 | 6.8% |
| e | 1858 | 6.8% |
| g | 1858 | 6.8% |
| m | 1806 | 6.6% |
| a | 1806 | 6.6% |
| l | 1806 | 6.6% |
| c | 1806 | 6.6% |
| o | 1806 | 6.6% |
| Other values (2) | 3612 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27402 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 5470 | |
| n | 3716 | |
| b | 1858 | 6.8% |
| e | 1858 | 6.8% |
| g | 1858 | 6.8% |
| m | 1806 | 6.6% |
| a | 1806 | 6.6% |
| l | 1806 | 6.6% |
| c | 1806 | 6.6% |
| o | 1806 | 6.6% |
| Other values (2) | 3612 |
| url_len | url_num_hyphens_dom | url_path_len | url_domain_len | url_hostname_len | url_num_dots | url_num_underscores | url_query_len | url_num_query_para | url_entropy | html_num_tags('iframe') | html_num_tags('script') | html_num_tags('object') | html_num_tags('div') | html_num_tags('form') | html_num_tags('a') | url_ip_present | url_port | html_num_tags('embed') | html_num_tags('head') | html_num_tags('body') | label | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| url_len | 1.000 | 0.077 | 0.847 | 0.173 | 0.180 | 0.399 | 0.404 | 0.347 | 0.335 | 0.815 | 0.004 | -0.204 | 0.002 | -0.211 | -0.010 | -0.229 | 0.000 | 0.088 | 0.000 | 0.079 | 0.136 | 0.060 |
| url_num_hyphens_dom | 0.077 | 1.000 | -0.192 | 0.553 | 0.556 | -0.141 | -0.119 | 0.088 | 0.075 | 0.092 | -0.071 | -0.137 | -0.072 | 0.156 | 0.137 | 0.058 | 0.084 | 0.000 | 0.000 | 0.031 | 0.071 | 0.280 |
| url_path_len | 0.847 | -0.192 | 1.000 | -0.234 | -0.227 | 0.368 | 0.429 | 0.150 | 0.137 | 0.688 | 0.029 | -0.123 | 0.024 | -0.267 | -0.036 | -0.223 | 0.000 | 0.161 | 0.000 | 0.083 | 0.032 | 0.107 |
| url_domain_len | 0.173 | 0.553 | -0.234 | 1.000 | 0.995 | 0.010 | -0.171 | 0.072 | 0.073 | 0.157 | -0.030 | -0.072 | -0.004 | 0.134 | 0.149 | 0.044 | 0.169 | 0.000 | 0.050 | 0.060 | 0.084 | 0.377 |
| url_hostname_len | 0.180 | 0.556 | -0.227 | 0.995 | 1.000 | -0.003 | -0.169 | 0.075 | 0.076 | 0.164 | -0.029 | -0.060 | -0.010 | 0.147 | 0.160 | 0.057 | 0.257 | 0.000 | 0.053 | 0.061 | 0.082 | 0.378 |
| url_num_dots | 0.399 | -0.141 | 0.368 | 0.010 | -0.003 | 1.000 | 0.175 | 0.097 | 0.099 | 0.333 | 0.017 | -0.179 | -0.015 | -0.220 | -0.141 | -0.163 | 0.046 | 0.000 | 0.000 | 0.000 | 0.027 | 0.076 |
| url_num_underscores | 0.404 | -0.119 | 0.429 | -0.171 | -0.169 | 0.175 | 1.000 | 0.173 | 0.168 | 0.351 | -0.023 | -0.119 | -0.041 | -0.194 | -0.177 | -0.143 | 0.032 | 0.397 | 0.000 | 0.092 | 0.015 | 0.144 |
| url_query_len | 0.347 | 0.088 | 0.150 | 0.072 | 0.075 | 0.097 | 0.173 | 1.000 | 0.922 | 0.360 | -0.040 | -0.129 | -0.024 | -0.015 | 0.042 | -0.097 | 0.000 | 0.000 | 0.000 | 0.000 | 0.207 | 0.177 |
| url_num_query_para | 0.335 | 0.075 | 0.137 | 0.073 | 0.076 | 0.099 | 0.168 | 0.922 | 1.000 | 0.345 | -0.051 | -0.139 | -0.020 | 0.005 | 0.034 | -0.077 | 0.038 | 0.000 | 0.000 | 0.000 | 0.225 | 0.235 |
| url_entropy | 0.815 | 0.092 | 0.688 | 0.157 | 0.164 | 0.333 | 0.351 | 0.360 | 0.345 | 1.000 | -0.016 | -0.167 | -0.015 | -0.124 | 0.024 | -0.145 | 0.271 | 0.056 | 0.025 | 0.037 | 0.146 | 0.256 |
| html_num_tags('iframe') | 0.004 | -0.071 | 0.029 | -0.030 | -0.029 | 0.017 | -0.023 | -0.040 | -0.051 | -0.016 | 1.000 | 0.307 | 0.136 | 0.245 | 0.198 | 0.270 | 0.000 | 0.000 | 0.000 | 0.032 | 0.000 | 0.071 |
| html_num_tags('script') | -0.204 | -0.137 | -0.123 | -0.072 | -0.060 | -0.179 | -0.119 | -0.129 | -0.139 | -0.167 | 0.307 | 1.000 | 0.036 | 0.555 | 0.386 | 0.595 | 0.035 | 0.000 | 0.000 | 0.200 | 0.073 | 0.087 |
| html_num_tags('object') | 0.002 | -0.072 | 0.024 | -0.004 | -0.010 | -0.015 | -0.041 | -0.024 | -0.020 | -0.015 | 0.136 | 0.036 | 1.000 | 0.005 | -0.048 | 0.102 | 0.076 | 0.000 | 0.753 | 0.000 | 0.099 | 0.120 |
| html_num_tags('div') | -0.211 | 0.156 | -0.267 | 0.134 | 0.147 | -0.220 | -0.194 | -0.015 | 0.005 | -0.124 | 0.245 | 0.555 | 0.005 | 1.000 | 0.532 | 0.819 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.036 |
| html_num_tags('form') | -0.010 | 0.137 | -0.036 | 0.149 | 0.160 | -0.141 | -0.177 | 0.042 | 0.034 | 0.024 | 0.198 | 0.386 | -0.048 | 0.532 | 1.000 | 0.457 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.035 |
| html_num_tags('a') | -0.229 | 0.058 | -0.223 | 0.044 | 0.057 | -0.163 | -0.143 | -0.097 | -0.077 | -0.145 | 0.270 | 0.595 | 0.102 | 0.819 | 0.457 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.036 |
| url_ip_present | 0.000 | 0.084 | 0.000 | 0.169 | 0.257 | 0.046 | 0.032 | 0.000 | 0.038 | 0.271 | 0.000 | 0.035 | 0.076 | 0.000 | 0.000 | 0.000 | 1.000 | 0.097 | 0.024 | 0.009 | 0.035 | 0.077 |
| url_port | 0.088 | 0.000 | 0.161 | 0.000 | 0.000 | 0.000 | 0.397 | 0.000 | 0.000 | 0.056 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.097 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| html_num_tags('embed') | 0.000 | 0.000 | 0.000 | 0.050 | 0.053 | 0.000 | 0.000 | 0.000 | 0.000 | 0.025 | 0.000 | 0.000 | 0.753 | 0.000 | 0.000 | 0.000 | 0.024 | 0.000 | 1.000 | 0.000 | 0.000 | 0.110 |
| html_num_tags('head') | 0.079 | 0.031 | 0.083 | 0.060 | 0.061 | 0.000 | 0.092 | 0.000 | 0.000 | 0.037 | 0.032 | 0.200 | 0.000 | 0.000 | 0.000 | 0.000 | 0.009 | 0.000 | 0.000 | 1.000 | 0.275 | 0.000 |
| html_num_tags('body') | 0.136 | 0.071 | 0.032 | 0.084 | 0.082 | 0.027 | 0.015 | 0.207 | 0.225 | 0.146 | 0.000 | 0.073 | 0.099 | 0.000 | 0.000 | 0.000 | 0.035 | 0.000 | 0.000 | 0.275 | 1.000 | 0.133 |
| label | 0.060 | 0.280 | 0.107 | 0.377 | 0.378 | 0.076 | 0.144 | 0.177 | 0.235 | 0.256 | 0.071 | 0.087 | 0.120 | 0.036 | 0.035 | 0.036 | 0.077 | 0.000 | 0.110 | 0.000 | 0.133 | 1.000 |
| url_len | url_num_hyphens_dom | url_path_len | url_domain_len | url_hostname_len | url_num_dots | url_num_underscores | url_query_len | url_num_query_para | url_ip_present | url_entropy | url_chinese_present | url_port | html_num_tags('iframe') | html_num_tags('script') | html_num_tags('embed') | html_num_tags('object') | html_num_tags('div') | html_num_tags('head') | html_num_tags('body') | html_num_tags('form') | html_num_tags('a') | html_num_tags('applet') | label | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 23.0 | 0.0 | 8.0 | 15.0 | 15.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.260333 | 0.0 | 0.0 | 0.0 | 7.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | malicious |
| 1 | 75.0 | 0.0 | 58.0 | 17.0 | 17.0 | 6.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.636107 | 0.0 | 0.0 | 0.0 | 18.0 | 0.0 | 0.0 | 20.0 | 1.0 | 1.0 | 0.0 | 21.0 | 0.0 | benign |
| 2 | 20.0 | 0.0 | 4.0 | 16.0 | 16.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 3.708966 | 0.0 | 0.0 | 1.0 | 33.0 | 0.0 | 0.0 | 101.0 | 1.0 | 1.0 | 3.0 | 70.0 | 0.0 | benign |
| 3 | 27.0 | 0.0 | 13.0 | 14.0 | 14.0 | 3.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.025592 | 0.0 | 0.0 | 0.0 | 15.0 | 0.0 | 0.0 | 151.0 | 1.0 | 1.0 | 1.0 | 55.0 | 0.0 | benign |
| 4 | 39.0 | 2.0 | 12.0 | 27.0 | 27.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.631833 | 0.0 | 0.0 | 0.0 | 10.0 | 0.0 | 0.0 | 332.0 | 1.0 | 1.0 | 0.0 | 321.0 | 0.0 | benign |
| 5 | 18.0 | 0.0 | 0.0 | 18.0 | 18.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 3.943465 | 0.0 | 0.0 | 0.0 | 4.0 | 1.0 | 1.0 | 3.0 | 1.0 | 1.0 | 0.0 | 18.0 | 0.0 | benign |
| 6 | 49.0 | 0.0 | 30.0 | 19.0 | 19.0 | 4.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.251365 | 0.0 | 0.0 | 0.0 | 8.0 | 0.0 | 0.0 | 19.0 | 1.0 | 1.0 | 1.0 | 4.0 | 0.0 | malicious |
| 7 | 25.0 | 0.0 | 0.0 | 25.0 | 25.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 3.890320 | 0.0 | 0.0 | 0.0 | 22.0 | 0.0 | 0.0 | 333.0 | 1.0 | 1.0 | 1.0 | 155.0 | 0.0 | benign |
| 8 | 39.0 | 0.0 | 22.0 | 17.0 | 17.0 | 3.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.417174 | 0.0 | 0.0 | 0.0 | 17.0 | 0.0 | 0.0 | 32.0 | 1.0 | 1.0 | 2.0 | 29.0 | 0.0 | benign |
| 9 | 40.0 | 0.0 | 1.0 | 18.0 | 18.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.772055 | 0.0 | 0.0 | 0.0 | 3.0 | 0.0 | 0.0 | 18.0 | 1.0 | 1.0 | 0.0 | 2.0 | 0.0 | malicious |
| url_len | url_num_hyphens_dom | url_path_len | url_domain_len | url_hostname_len | url_num_dots | url_num_underscores | url_query_len | url_num_query_para | url_ip_present | url_entropy | url_chinese_present | url_port | html_num_tags('iframe') | html_num_tags('script') | html_num_tags('embed') | html_num_tags('object') | html_num_tags('div') | html_num_tags('head') | html_num_tags('body') | html_num_tags('form') | html_num_tags('a') | html_num_tags('applet') | label | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3654 | 63.0 | 0.0 | 49.0 | 14.0 | 14.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.652737 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 2.0 | 1.0 | 2.0 | 1.0 | 0.0 | 0.0 | malicious |
| 3655 | 25.0 | 0.0 | 14.0 | 11.0 | 11.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 3.905639 | 0.0 | 0.0 | 0.0 | 14.0 | 0.0 | 0.0 | 36.0 | 1.0 | 1.0 | 0.0 | 41.0 | 0.0 | benign |
| 3656 | 126.0 | 0.0 | 46.0 | 17.0 | 17.0 | 2.0 | 5.0 | 62.0 | 2.0 | 0.0 | 5.025647 | 0.0 | 0.0 | 0.0 | 14.0 | 0.0 | 0.0 | 46.0 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | malicious |
| 3657 | 42.0 | 0.0 | 21.0 | 21.0 | 21.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.148415 | 0.0 | 0.0 | 0.0 | 63.0 | 0.0 | 0.0 | 17.0 | 1.0 | 1.0 | 1.0 | 45.0 | 0.0 | benign |
| 3658 | 14.0 | 0.0 | 0.0 | 14.0 | 14.0 | 3.0 | 0.0 | 0.0 | 0.0 | 1.0 | 3.499228 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 | 1.0 | 1.0 | 0.0 | 1.0 | 0.0 | benign |
| 3659 | 68.0 | 3.0 | 16.0 | 52.0 | 52.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.135356 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 11.0 | 1.0 | 1.0 | 0.0 | 3.0 | 0.0 | malicious |
| 3660 | 66.0 | 0.0 | 48.0 | 18.0 | 18.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.362331 | 0.0 | 0.0 | 1.0 | 14.0 | 0.0 | 0.0 | 212.0 | 1.0 | 1.0 | 3.0 | 475.0 | 0.0 | benign |
| 3661 | 90.0 | 1.0 | 64.0 | 26.0 | 26.0 | 4.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.693343 | 0.0 | 0.0 | 0.0 | 13.0 | 0.0 | 0.0 | 75.0 | 1.0 | 1.0 | 2.0 | 103.0 | 0.0 | malicious |
| 3662 | 46.0 | 0.0 | 33.0 | 13.0 | 13.0 | 3.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.604166 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.0 | 1.0 | 1.0 | 0.0 | 3.0 | 0.0 | benign |
| 3663 | 18.0 | 0.0 | 0.0 | 18.0 | 18.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 3.619471 | 0.0 | 0.0 | 0.0 | 3.0 | 0.0 | 0.0 | 282.0 | 1.0 | 1.0 | 2.0 | 46.0 | 0.0 | benign |
Most frequently occurring
| url_len | url_num_hyphens_dom | url_path_len | url_domain_len | url_hostname_len | url_num_dots | url_num_underscores | url_query_len | url_num_query_para | url_ip_present | url_entropy | url_chinese_present | url_port | html_num_tags('iframe') | html_num_tags('script') | html_num_tags('embed') | html_num_tags('object') | html_num_tags('div') | html_num_tags('head') | html_num_tags('body') | html_num_tags('form') | html_num_tags('a') | html_num_tags('applet') | label | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 67 | 19.0 | 0.0 | 6.0 | 13.0 | 13.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 3.950064 | 0.0 | 0.0 | 0.0 | 6.0 | 0.0 | 0.0 | 38.0 | 1.0 | 1.0 | 1.0 | 18.0 | 0.0 | benign | 5 |
| 113 | 26.0 | 0.0 | 12.0 | 14.0 | 14.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 3.995715 | 0.0 | 0.0 | 0.0 | 7.0 | 0.0 | 0.0 | 51.0 | 1.0 | 1.0 | 1.0 | 198.0 | 0.0 | benign | 5 |
| 122 | 27.0 | 1.0 | 0.0 | 27.0 | 27.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.278352 | 0.0 | 0.0 | 0.0 | 9.0 | 0.0 | 0.0 | 200.0 | 1.0 | 1.0 | 2.0 | 122.0 | 0.0 | benign | 5 |
| 245 | 44.0 | 0.0 | 27.0 | 17.0 | 17.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.139468 | 0.0 | 0.0 | 0.0 | 2.0 | 0.0 | 0.0 | 3.0 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | malicious | 5 |
| 5 | 13.0 | 0.0 | 0.0 | 13.0 | 13.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 3.684184 | 0.0 | 0.0 | 0.0 | 18.0 | 0.0 | 0.0 | 219.0 | 1.0 | 1.0 | 1.0 | 344.0 | 0.0 | benign | 4 |
| 8 | 13.0 | 0.0 | 0.0 | 13.0 | 13.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 3.884184 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 3.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | benign | 4 |
| 23 | 16.0 | 0.0 | 0.0 | 16.0 | 16.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 3.653997 | 0.0 | 0.0 | 1.0 | 16.0 | 0.0 | 0.0 | 79.0 | 1.0 | 1.0 | 1.0 | 161.0 | 0.0 | benign | 4 |
| 46 | 17.0 | 0.0 | 0.0 | 17.0 | 17.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.053509 | 0.0 | 0.0 | 1.0 | 6.0 | 0.0 | 0.0 | 249.0 | 1.0 | 1.0 | 3.0 | 162.0 | 0.0 | benign | 4 |
| 47 | 17.0 | 0.0 | 0.0 | 17.0 | 17.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.084963 | 0.0 | 0.0 | 0.0 | 13.0 | 0.0 | 0.0 | 95.0 | 0.0 | 1.0 | 1.0 | 62.0 | 0.0 | benign | 4 |
| 66 | 19.0 | 0.0 | 0.0 | 19.0 | 19.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.008132 | 0.0 | 0.0 | 0.0 | 4.0 | 0.0 | 0.0 | 183.0 | 1.0 | 1.0 | 1.0 | 564.0 | 0.0 | benign | 4 |